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Abstract. One of the fundamental problem in the theory of sorting is 
to find the pessimistic number of comparisons sufficient to sort a given 
number of elements. Currently 16 is the lowest number of elements for 
which we do not know the exact value. We know that 46 comparisons suf- 
fices and that 44 do not. There is an open question if 45 comparisons are 
sufficient. We present an attempt to resolve that problem by perform- 
ing an exhaustive computer search. We also present an algorithm for 
counting linear extensions which substantially speeds up computations. 



1 Introduction 

We consider sorting by comparisons. One of the fundamental problem in that 
area is to find the pessimistic number 5(n) of comparisons sufficient to sort n 
elements. Steinhaus posed this problem in [8]. Knuth considered it in pQ. From 
the information-theoretic lower bound, further denoted by ITLB, we know that 
5(n) > [log 2 n!] = C(n). Ford and Johnson discovered [2] an algorithm, further 
denoted by FJA, which nearly and sometimes even exactly matches C(n). Let 
F(n) be the pessimistic number of comparisons in the FJA. It holds S(n) = 
F(n) = C{n) for n < 11 and n = 20,21. The FJA does not achieve the ITLB 
for 12 < n < 19 and infinitely many n > 22. Carrying an exhaustive computer 
search, Wells discovered in 1965 [9110] that the FJA is optimal for 12 elements 
and 5(12) = F(12) = C(12) + l = 30. Kasai et al. [3] computed 5(13) = F(13) = 
C(13) + 1 = 34 in 1994, but that result was not widely known. It was discovered 
again a few years later [5], independently, extending the Wells method. Further 
improvement of the method led to show in years 2003-2004 |6I7] that it holds 
S(n) = F(n) = C(n) + 1 for n = 14, 15, 22, similarly. 

In this paper we consider the case n = 16. This is now the lowest number of 
elements for which we do not know the exact value of S(n). The previous results 
could suggest that 5(16) = F(16) = C(16) + 1 = 46. However Knuth conjectures 
that 5(16) = C(16) = 45. He does not believe that the FJA is optimal for 
16 elements. He wrote [1]: "There must be a way to improve upon this!" We 
present recently obtained resulttQ aiming to compute the value of 5(16). It is 



1 The results presented in this paper are obtained using computer resources of the In- 
terdisciplinary Centre for Mathematical and Computational Modelling (ICM), Uni- 
versity of Warsaw. 
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very unlikely that someone will find it by pure theoretical consideration. It seems 
that the only promising way leads by performing an exhaustive computer search 
supported by cleaver heuristic. 

The paper is organized as follows. In Sect. [5] we introduce notation used 
throughout the paper. In Sect. [3] we briefly describe the algorithm we use to 
resolve if there exists a sorting algorithm for a given number of elements and 
comparisons. We analyse why the ITLB is not achieved for 13, 14 and 15 elements 
in Sect. 01 We present the newest results for 16 elements in Sect. El In Sect. [6] 
we compare the computation complexity of the previous cases and the case of 
16 elements. Finally, in Sect. [71 we present the algorithm for counting linear 
extensions which substantially improves the algorithm from Sect. [3] 

2 Notation 

We denote by U — {uq, u\, . . . , u„_i} an n-element set to be sorted. Sorting of 
the set U is represented as a sequence of posets (P c — (U, R c ))c=o,i,...,c, where 
R c is a partial order relation over a set U . Sorting starts from the total disorder 
Pq = (U,Rq), where Rq = {{u, u) : u <= U}. After performing c comparisons we 
obtain a poset P c = (U, R c ). Sorting should end with a linear order Pq. Assume 
that elements Uj and Uk are being compared in step c. Without loss of generality 
we can assume that (uj, Uk) $ R c -i and (v,k, Uj) £ R c -i- Suppose the answer to 
the comparison is that element Uj is less than element Uk- Then we obtain the 
next poset P c = (U,R C ), where the relation R c is the transitive closure of the 
relation R c -\ U {(iij,Uk)}- We denote this by P c = P c -\ + ujUk- 

By e(P) we denote the number of linear extensions of a poset P = (U,R). 
We assume that e(P + UjUk) = e(P) and e(P + UkUj) = if elements Uj, Uk are 
in relation, i.e., if (v,j,Uk) S R- 

3 The Algorithm 

In this section we remember briefly the algorithm which answers if sorting of a 
given poset Pq can be finished in C comparisons. The algorithm was invented 
in [9110] and improved in [5] and later in [6]. We present the next improvement 
to the algorithm in Sect. [Jj The algorithm has two phases: forward steps and 
backward steps. 

In the forward steps we consider a sequence of sets (<S c )c=o,i c- The set S$ 

contains only the poset Pq. In step c we construct the set S c from the set 5 c _i. 
Every poset P G 5 C — l is examined for every unrelated pair (v,j,Uk) in order to 
verify whether it can be sorted in the remaining C — c + 1 comparisons. As the 
result of the comparison of Uj and Uk one can get one of two posets P± = P+UjUk 
or Pi = P + UkUj. If the number of linear extensions of Pi or P2 exceeds 2 C ~ C 
then by the ITLB it cannot be sorted in the remaining C — c comparisons. It 
follows that in this case, in order to finish sorting in C — c + 1 comparisons, 
elements Uj and u/~ should not be compared in step c. If the number of linear 
extensions of both Pi and P 2 do not exceed 2 C ~° then we store one of them in 
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the set S Cl namely that with greater number of linear extensions. If both have 
the same number of linear extensions we choose P± arbitrarily. We do not store 
isomorphic posets or a poset which dual poset is isomorphic to some already 
stored poset. 

If some set S c in the sequence appears to be empty then we conclude that the 
poset Po cannot be sorted in C comparisons. Such results are received for 12 and 
22 elements and C = C (n) [6] , where the set 1S23 and S40 is empty, respectively. 
Wells reported [10] that for n = 12 only the set £24 is empty. Those results mean 
that S(n) > C(n) for n = 12, 22. If the set Sc is not empty after performing 
forwards steps, we cannot conclude about sorting of the poset Po- In that case 
we continue with backward steps. 

In the backward steps we consider the sequence of sets (5*) c =o,i....,C- We 
start with the set S c = Sc which contains only a linear order of the set U. In 
step c, where c = C — 1, C — 2, . . . , 0, we construct the set S* from the set S* +1 . 
The set S* is a subset of the set S c and contains only posets which can be sorted 
in the remaining C — c comparisons. Poset P G S c is stored in S* iff there exists 
in P a pair of unrelated elements (uj, Uk) such that poset P\ = P + ujUk or poset 
P2 = P + UkUj belongs to the set S* +1 (as previously we identify isomorphic and 
dual posets) and both posets are sortable in C — c — 1 comparisons. Therefore 
we store the poset P in the set S* iff both Pi, P 2 G S* +1 or P 1 G S* +1 and P 2 is 
sortable in C — c — 1 comparisons or P 2 G S* +l and Pi is sortable in C — c — 1 
comparisons. Sortability of Pi or P2 can be checked recursively using the same 
algorithm. 

If some set S* in the sequence appears to be empty then we conclude that 
the poset Pq cannot be sorted in C comparisons. On the other hand, if the set 
<Sq is not empty, it contains the poset Po and we conclude that the poset Po can 
be sorted in C comparisons. For n — 13, 14, 15 and C = C(n) we received that 
the set <Si 5 is empty [5 6 7, which means that S(n) > C(n) for n = 13, 14, 15. 
We analyze those results in detail in the next section. 

4 The Previous Cases 

The computer experiment for n — 13 and C = C(n) returns that the set S^ 5 is 
empty, which means that 5(13) = F(13) = C(13) + l = 34 5 . In that experiment 
the set 5i 6 contains only one poset Pi6, whose Hasse diagram is shown in Fig. 
[T] The poset Pi6 can be obtained from a poset contained in the file 1S15 in two 
ways: 

— we compare elements uq and u\o in the poset P{ 5 G 5i5 shown in Fig. [2j 
if Mo > itio we obtain the poset Pi6; if wo < uiq we obtain the poset Q' 16 
shown in Fig. [3J 

— we compare elements uq and u§ in the poset P" 5 G 5is shown in Fig. if 
uq < uq we obtain the poset Pig ; if uq > uq we obtain the poset Q'{ 6 shown 
in Fig. El 

Neither the poset P[ 5 nor the poset P" 5 can be stored in the file S* 5 , because 
neither the poset Q' 16 nor the poset Q'{ 6 can be sorted in the remaining C — 16 = 
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Fig. 1. The poset P 16 , e(P 16 ) = 113400 




Fig. 2. The poset P{ 5 , e(P{ 5 ) = 222750 




Fig. 3. The poset Q' 16 , e(Q' 16 ) = 109350 
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17 comparisons. It is quite surprising that the posets Q'i & , Q'{ 6 cannot be sorted. 
The poset Q' 16 has less linear extensions than the poset Pie, which intuitively 
should make it easier to sort. Indeed, the poset Q'{ 6 has more linear extensions 
than the poset Pie, which intuitively makes it harder to sort. On the other hand, 
there are known the two largest elements of the poset Q'{ 6 , which intuitively 
makes it easier to sort. The poset P\e is sortable in 17 comparisons because of 
its symmetry. 




Fig. 4. The poset P{' 5 , e(P 1 " 5 ) = 238140 




Fig. 5. The poset Q'{ 6 , e(Q'/ 6 ) = 124740 

Similar results were received in the computer experiments for n = 14, 15 and 
C = C{n), i.e., S(U) = F(U) = C(14) + 1 = 38 and 5(15) = F(15) = C(15) + 
1 = 42. In both cases the file Si 6 contains only one poset, namely the poset 
Pie extended by one isolated element M13 (for n = 14) or two isolated elements 
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U13, U14 (for n = 15), respectively. In both cases the file Si 5 is empty and the 
reason is the same. The posets P{ 5 , Q'{ 5 , Q' 16 , Q'{ 6 extended by u%3 or u%3, U14 
are observed, respectively, and neither Q' w nor Q'{ 6 is sortable in the remaining 
C — 16 comparisons. Note that for n = 14 we have C — 16 = C(n) — 16 = 21 and 
for n = 15 we have C — 16 = C(n) — 16 = 25. 

5 The Case of 16 Elements 

In this section we describe an attempt to find for n = 16 a sorting algorithm 
better than the FJA or to exclude existence of such algorithm. Before starting a 
long time computation it was checked if the scenario from the previous section 
repeats for n = 16. The posets Q' 16 , Q'{ 6 were extended by three isolated elements 
U13, U14, U15. As previously, the experiment returned that neither the poset Q' le 
nor the poset Q'{ 6 can be sorted in the remaining C — 16 — C(n) — 16 = 29 
comparisons. Of course, this result does not exclude the existence of the desired 
algorithm. 

To find the exact value of 5(16) the algorithm from Sect. [3] with improvement 
from Sect. [7] is applied. Because the search space is very reach, the problem is 
divided into smaller subproblems. Let T(k) be the number of elements which 
were compared (touched) by a sorting algorithm in the first k comparisons. 
Observe that T(ki) < T{ki) for k% < &2- A sorting algorithm for 16 elements, 
using at most C(16) = F(16) — 1 = 45 comparisons, is examined for possible 
values of T(k). 

The first experiment returned that if 5(16) = 45 then it holds T(15) < 16. 
Note that for the FJA we have T(k) — 16 for k > 8. Hence a hypothetical 
algorithm, using for 16 elements pessimistically less comparisons than the FJA, 
must be complete different from the FJA. It must differ from the FJA already 
before the 9th comparison. This is quite surprising, when we look at regular 
structure of the first 15 comparisons in the FJA. The next experiment showed 
that if 5(16) = 45 then T(15) > 11, which is already not surprising. 

6 Computation Complexity 

Computation complexity of the method groves exponentially. The case 5(13) 
needed in year 2002 [3] more than 10 hours of CPU time. The value of 5(14) 
was computed one year later (published in 2004 [5]) and took about 392 hours on 
faster computer and using improved algorithm, which could solve 5(13) in about 
40 minutes. Further progress in hardware allowed to compute the value of 5(15) 
in year 2004 (published only in 2007 [7]) using about 17500 hours of CPU time. 
Each next case required significant improvements in the algorithm or hardware. 
The progress is presented in Table [TJ One can argue that the comparison is not 
fair, because the machines used in the experiments are different. The purpose 
of this table is to show an overall improvement in software and hardware, and 
to give a filling, how difficult the case of 16 elements could be. The about 10 
times improvement observed between the second last and the last column is due 
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mainly to the algorithm described in the next section. Note that for Core 2 Due 
processor both cores were used in parallel. 



Table 1. Computation times 



n 


Pentium II 


Pentium III 


Opteron 246 


Core 2 Duo 




233 MHz 


650 MHz 


2 GHz 


2.13 GHz 




2002 


2003 


2004 


2007 


13 


10 hr. 30 min. 


41 min. 


10 min. 44 sec. 


46 sec. 


14 




391 hr. 37 min. 


44 hr. 10 min. 


4 hr. 31 min. 


15 






17554 hr. 





A few years of CPU time was used up to now to search for an algorithm 
achieving the ITLB for n = 16. The computation that T(15) < 16 and T(15) > 
11 took about 20000 and 7000 hours, respectively. Computation for the next case 
T(15) = 12 is currently in progress. It used up to now more than 11000 hours. 

7 Counting Linear Extensions 

The most time consuming part of the algorithm presented in Sect. [3] is counting 
linear extensions of a given poset. In this section we describe the algorithm 
for counting linear extensions which is inspired by [T] and which substantially 
improves computations. For a given poset P = (U, -<) the algorithm computes 
e(P) and the table t[j, k] — e(P + UjUu) for j ^ k. 



{U0,Ul,U2, 113} 








Fig. 6. A poset and the graph of its downsets 
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d{U) = 5,u(U) = 1 




d(«0,Wl,W3) = 3, i/(uo, 1/1,1/3) = 1 



d(iio, 1/1,1/2) — 2,1/(1/0,1x1,1/2) — 1 



d(i/i,w 3 ) = 1,1/(1/1,1/3) = 1 



d(uo,ui) = 2, 1/(1/0,1/1) = 2 



d(ui) = 1, u(wi) = 3 



d(uo) = 1, u(i/o) = 2 



Fig. 7. The numbers of linear extensions of the downsets and they complemen- 
tary sets 

Let P — (U,-<) be a poset. A subset D C U is called a down set of the poset 
P if for each x € D all elements y & U preceding x (i.e., y -< x) also belong to 
D. We consider a directed acyclic graph Q whose nodes are all downsets of P. 
For two nodes D\ and D2 there is an edge (Di,D2) if there exists x 6 U\D\ 
such that Z?2 = Di U {a;}. An example of a poset and its graph of downsets is 
shown in Fig. El where U = {u , u%, u%, u 3 }. 

Let d(D) denote the number of linear extensions of the poset (D, -<) which 
is the poset P reduced to the down set D. Let u{D) denote the number of 
linear extensions of the poset (U \ D, -<) which is the poset P reduced to the 
complementary set of the down set D. We have [1] 



where the sum is taken over all edges {X, D) in the graph Q incoming to the 
node D. We assume d(0) = 1. Observe that d(U) = e(P). All values of d{D) 
are computed using the DFS in the graph Q, starting at the node U and going 
down, i.e., in the opposite direction to the edges. Similarly, it holds [1] 



where the sum is taken over all edges (D, X) in the graph Q outgoing from the 
node D. We assume u(U) = 1. Observe that u{%) — e(P). All values of u(D) 
are computed using the second DFS in the graph Q, starting at the node and 
going up. Values of d{D) and u(D) for the graph in Fig. [6] are shown in Fig. 
The curly braces are omitted for clarity, e.g., instead of d({uo}) we write d(uo). 
The table t can be computed from the equation 



d(D) = ]T d(X), 



(X,D) 



u 



(D) = ^ <X), 



(D,X) 



t[j,k] - d(V)u(W), 



(V,W) 
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k 

1 2 


3 







2 5 


4 


I— 1 


3 


5 


5 


J 2 








2 


3 


1 


3 





where the sum is taken over all edges (V, W) in the graph Q such that W — 
VVJ{uj} and € [7\W. For a proof see pQ. This computation is done altogether 
with the second DFS. For the graph in Fig. [6] the values t[j, k] are included in 
Tabled 

For a given poset on an n-element set its graph of downsets can have up 
to 2™ nodes. We implemented the graph as a table of the size 2™. The table is 
indexed by downsets. The index is the characteristic function of the set D, i.e., 
the index is the n-bit number, where bit j is set iff Uj G D. Graph Q is not 
constructed explicitly. When we proceed a node D all incoming and outgoing 
edges are easily computable from a poset representation. We hold at position D 
in the table only two numbers d(D), u{D) and visited time stamp v(D) needed 
to implement the DFS. We initialize the table only once at the beginning of the 
program by setting all v(D) = 0. We also hold the global visited time stamp 
Vt initialized to 0. Starting a new DFS we increment the time stamp w t . If we 
proceed a node D and Vt > v{D) then it means that the node D was not 
yet visited in the current DFS run. If vt = v(D) then the node was already 
visited. We do not need to reinitialize the table before the next DFS. This is 
very important and decreases running time. The algorithm is very efficient for 
small n, because with a high probability the whole graph resides in a processor 
cache memory. 
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